Topological solution of missing attribute values problem in incomplete information tables

نویسنده

  • A. S. Salama
چکیده

In this paper, we present a method of data decomposition to avoid the necessity of reasoning on data with missing attribute values. We defined a general binary relation on the original incomplete data which decomposed into data subsets without missing values. Next, topological base relation method for classifier induction is applied to such sets. New approach to find the missing values in consistent data is discussed. New pretopological approximations of concepts are studied and some of their properties are proved. Pre-topological measures are initiated in incomplete information systems and more pre-approximations are defined and studied. Finally, the reducts and core are determined.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Rough Set Model Based on Probabilistic Similarity Measure for Incomplete Decision Tables

Rough set models in incomplete decision tables have been discussed so far. Numerous approaches to deal with missing values in incomplete information systems have been proposed. In this paper, assuming that the domain of attribute values is defined, we apply the probability of values appearing in data tables in order to measure the self-information of similarity. This is defined as the uncertain...

متن کامل

Rough Set Approaches to Rule Induction from Incomplete Data

In this paper we assume that data are presented in the form of decision tables, incomplete when some attribute values are missing. Two main cases of missing attribute values are considered: lost (the original value was erased) and "do not care" conditions (the original value was irrelevant). This paper uses, as the main tool, attribute-value pair blocks. These blocks are used to construct chara...

متن کامل

Data with Missing Attribute Values: Generalization of Indiscernibility Relation and Rule Induction

Data sets, described by decision tables, are incomplete when for some cases (examples, objects) the corresponding attribute values are missing, e.g., are lost or represent “do not care” conditions. This paper shows an extremely useful technique to work with incomplete decision tables using a block of an attribute-value pair. Incomplete decision tables are described by characteristic relations i...

متن کامل

A BAYESIAN APPROACH TO COMPUTING MISSING REGRESSOR VALUES

In this article, Lindley's measure of average information is used to measure the information contained in incomplete observations on the vector of unknown regression coefficients [9]. This measure of information may be used to compute the missing regressor values.

متن کامل

A Comparative Analysis of Multigranular Approaches and on Topoligical Properties of Incomplete Pessimistic Multigranular Rough Fuzzy Sets

Rough sets, introduced by Pawlak as a model to capture impreciseness in data have been a very useful tool in several applications. These basic rough sets are defined by taking equivalence relations over a universe. In order to enhance the modeling powers of rough sets, several extensions to the basic definition has been introduced over the past few years. Extending the single granular structure...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Inf. Sci.

دوره 180  شماره 

صفحات  -

تاریخ انتشار 2010